Options for Modelling Temporal Statistical Dependencies in an Acoustic Model for ASR
نویسندگان
چکیده
In this paper we consider the combination of hidden Markov models based on Gaussian mixture densities (GMM-HMM) and linear dynamic models (LDM) as the acoustic model for automatic speech recognition systems. In doing so, the individual strengths of both models, i.e. the modelling of long-term temporal dependencies by the GMM-HMM and the direct modelling of statistical dependencies between consecutive feature vectors by the LDM, are exploited. Phone classification experiments conducted on the TIMIT database indicate the prospective use of this approach in continuous speech recognition.
منابع مشابه
N-gram Based Modelling of Time and Feature Dependencies in HMM
Developing new acoustical models that overcome HMM based modelling restrictions is an active field of research in automatic speech recognition. In this paper two approaches are presented. The first one consists on exhaustively modelling the dependencies between the acoustic features used to parametrize the speech signal, improving the traditional use of dynamic features. N-gram based modelling ...
متن کاملNATIONAL UNIVERSITY OF SINGAPORE School of Computing PH.D DEFENCE - PUBLIC SEMINAR
Automatic Speech Recognition (ASR) has been one of the most popular research areas in computer science. Many state-of-the-art ASR systems still use the Hidden Markov Model (HMM) for acoustic modelling due to its efficient training and decoding. HMM state output probability of an observation is assumed to be independent of the other states and the surrounding observations. Since temporal correla...
متن کاملModelling of Resonance Frequency of MEMS Corrugated Diaphragm for Capacitive Acoustic Sensors (TECHNICAL NOTE)
In this paper, a new model for resonance frequency of clamped circular corrugated diaphragm has been presented. First, an analytical analyzes has been carried out to derive mathematic expressions for mechanical sensitivity of diaphragm with residual stress. Next by using Rayleigh's method we present mathematical model to calculate the resonance frequency of corrugated diaphragm and investigate ...
متن کاملOn the Use of Augmented Hmm Models for Overcoming Time and Parameter Independence Assumptions in Asr
There is significant interest in developing new acoustic models for speech recognition that overcome traditional HMM restrictions. In this work, we propose to use Ngram based augmented HMMs. Two approaches are presented. The first one consists on overcoming the parameter independence assumption. This is achieved by modeling the dependence between the different acoustic parameters, using N-gram ...
متن کاملEvaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles
One of the challenges in automatic speech recognition is how to handle pronunciation variation. The main causes for pronunciation variation are the speaker (voice characteristics, accent, non-nativeness etc.) and the speaking style (reading, spontaneous responses, conversation etc.). An ASR system has basically two options for modelling the variation on the word and sub-word level: lexical mode...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010